New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat(v2): metastore index #3586

Draft

aleks-p wants to merge 34 commits into main from v2/metastore-index

+1,692 −521

Contributor

aleks-p commented Sep 23, 2024

Introduces the metastore/index package which adds a time-partitioned block index and integrates it with the existing flows for adding, compacting and querying blocks. This is a draft with a few parts missing (some of which are marked with FIXME and TODO comments).

This is a breaking change meant for the v2 work stream. Once merged existing blocks will not be reachable.

aleks-p force-pushed the v2/metastore-index branch 2 times, most recently from f53bf79 to af472ea Compare

September 26, 2024 12:32

kolesnikovae reviewed

View reviewed changes

pkg/experiment/metastore/metastore_block_store.go Show resolved Hide resolved

pkg/experiment/metastore/index/index.go Outdated Show resolved Hide resolved

pkg/experiment/metastore/index/index.go Outdated Show resolved Hide resolved

pkg/experiment/metastore/index/index.go Outdated Show resolved Hide resolved

pkg/experiment/metastore/index/index.go Outdated Show resolved Hide resolved

pkg/experiment/metastore/metastore_block_store.go Outdated Show resolved Hide resolved

pkg/experiment/metastore/index/models.go Outdated Show resolved Hide resolved

kolesnikovae reviewed

View reviewed changes

pkg/experiment/metastore/index/models.go Outdated Show resolved Hide resolved

pkg/experiment/metastore/index/index.go Outdated Show resolved Hide resolved

pkg/experiment/metastore/index/index.go Outdated Show resolved Hide resolved

pkg/experiment/metastore/index/index.go Outdated Show resolved Hide resolved

pkg/experiment/metastore/index/models.go Outdated Show resolved Hide resolved

pkg/experiment/metastore/index/index.go Outdated Show resolved Hide resolved

pkg/experiment/metastore/index/index.go Outdated Show resolved Hide resolved

aleks-p added 26 commits

October 1, 2024 08:11


Move error checking elsewhere

92ae04d


Add metastore index (wip)

2fea21d


Fix a few bugs and tests

9ced456


Remove unneeded file

d22b6ce


Fix race condition


Enable new write path for local deployments

49ddafa


Return level 0 blocks, remove duplicates

227cdb9


Add simple logging

4fdceb7


Remove unused var

48a355e


Refactor out metastore index, add partition meta

b312c2a


Refactor, minor perf improvements

9e10d70


Fix local setup for v2

6b9e5ab


Replace block truncation with block retention

8c35544


Refactor some more

8e497d6


Add tests, basic config

231cddc


Add documentation

312cac8


Add todo

f8ae8cd


Remove unused struct field tags

388f2c2


Add todo

6434a3d


Run make fmt

7b9f769


Run make fmt

f1a3a8e


Allow longer queries in v2 locally

944dffb


Add test for changing the partition duration

ad75087


Update test

1b138c5


Address issue with finding / deleting blocks following a partition du…

bcfb774

…ration change


Add test cases

d4d6c9c

aleks-p force-pushed the v2/metastore-index branch from 5755b3e to c5f46ce Compare

October 1, 2024 18:24

kolesnikovae reviewed

View reviewed changes

pkg/experiment/metastore/index/index.go

+ Config *Config
+ partitionMu sync.Mutex
+ loadedPartitions *lru.Cache[PartitionKey, *indexPartition]

Collaborator

kolesnikovae Oct 2, 2024

I think the cache should be per partition+tenant to avoid interference

Contributor Author

aleks-p Oct 3, 2024

If you mean using partition+tenant as the key, that will also have interference (busy tenants would push others out of the cache). For separation we could do separate caches per tenant (complex, costly) or go back to the previous solution (an unbounded TTL-based cache, uneven memory usage).

As discussed separately, this cache is also not suitable because we can get the active (for writes) partition unloaded with a large query. This again can be solved by switching to a different caching strategy (ARC, 2Q, etc.).

Personally, given our usage patterns (frequent writes, infrequent reads) I lean towards a custom solution similar to what we had before, with an upper bound on how many items stay in memory and explicit checks that prevent the write partition to be unloaded.

pkg/experiment/metastore/index/index.go

+ s.tenants[b.TenantId] = ten
+ }
+ ten.blocks[b.Id] = b

Collaborator

kolesnikovae Oct 2, 2024

We probably should reject insert if the block is already present. Like, First Write Wins – block meta is immutable

pkg/experiment/metastore/index/index.go

Comment on lines +173 to +181

+ for _, t := range i.store.ListTenants(meta.Key, s) {
+ te := &indexTenant{
+ blocks: make(map[string]*metastorev1.BlockMeta),
+ }
+ for _, b := range i.store.ListBlocks(meta.Key, s, t) {
+ te.blocks[b.Id] = b
+ }
+ sh.tenants[t] = te
+ }

Collaborator

kolesnikovae Oct 2, 2024 •

edited

Loading

I believe we should avoid loading all tenants for a partition (I just can't find a use case)

pkg/experiment/metastore/index/index.go

Comment on lines +437 to +444

+func (i *Index) tryDelete(key PartitionKey, shard uint32, tenant string, blockId string) (*metastorev1.BlockMeta, *PartitionMeta, bool) {
+ meta := i.findPartitionMeta(key)
+ if meta == nil {
+ return nil, nil, false
+ }
+ p := i.getPartition(meta)

Collaborator

kolesnikovae Oct 2, 2024 •

edited

Loading

I think we shouldn't load partition from disk here:

If partition (its part you want to modify) is in memory – update it.
Delete the block from store. (and make sure we created tombstone for the block – out of scope for the PR)

Contributor Author

aleks-p Oct 3, 2024

Agreed, we still need to find partition for the block in order to delete it but there is no need to load the partition.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment